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FOREWORD 


Ibis  report  on  artificial  intelligence  technologies  for  language  training 
reviews  the  current  methods  and  tools  available  for  creating  natural  language 
training  systems  on  computers.  Military  managers  need  to  be  aware  of  advanced 
technologies  and  their  application  to  training  problems,  This  review  de¬ 
scribes  the  technology  base  from  which  U.S.  Army  Research  Institute  for  the 
Behavioral  and  Social  Sciences  (ARI)  scientists  are  working  to  develop  foreign 
language  training  systems. 
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B  SUMMARY 


Requirement: 


Die  U.S.  Army  Research  Institute  for  the  Behavioral  and  Social  Sciences 
(ARI)  conducts  research  on  applications  of  advanced  technology  in  training 
systems  for  the  Army.  One  application  involves  developing  natural  language 
processing  capability  on  ccrrputer  systems  for  language  training  and  machine 
translation.  This  report  presents  an  overview  of  natural  language  processing 
technology  applicable  to  these  capabilities. 


Procedure: 

A  literature  review  and  evaluation  of  selected  ocnputer  languages ,  tools, 
and  systems  was  conducted. 


Findings: 

Results  of  the  literature  review  and  system  evaluations  are  described  in 
the  report. 


Utilization  of  Findings: 

The  purpose  of  this  report  is  two-fold.  First,  the  report  describes 
various  computer  tools  necessary  for  developing  a  computer  training  system  for 
the  Army  to  support  retention  skills  in  foreign  languages.  Second,  the  report 
provides  military  personnel  with  a  general  overview  of  natural  language  pro¬ 
cessing  and  artificial  intelligence  technology  as  applied  to  Army  training 
systems. 
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at  Tools  for  Foreign  language  Training 
Introduction 


Several  Amy  military  occupational  specialties  (MOS)  have  a  foreign  language 
requirement.  Training  newly  assigned  soldiers  in  relevant  languages  inpoees  a 
continuing  problem  of  maintaining  productivity  in  the  schoolhouse  and 
naintaining  language  skills  on  the  job.  Soldiers  arriving  from  a  training 
school  often  have  good  general  language  skills  but  little  background  in  a 
specific  job  assignment  in  the  foreign  language.  Subject  matter  knowledge, 
such  as  preparation  of  the  battlefield,  operational  orders,  and  interrogation 
are  unusually  not  covered  in  the  initial  language  instruction  the  soldiers 
receive.  For  the  newly  assigned  soldier, there  is  an  imposing  array  of 
difficult  terminology  and  new  linguistic  structures  to  acquire.  As  a  result, 
additional  language  acquisition  as  well  as  language  Skill  retention  are 
required  of  the  soldier,  coupled  with  learning  new  MDS  related  skills.  However, 
hard-copy  training  materials  to  provide  the  soldier  with  the  additional 
language  training  are  difficult  to  develop  and  maintain  across  the  26  languages 
the  Amy  teaches.  Nevertheless,  several  ocrpelling  technologies  are  reaching  a 
state  of  ready  application  to  solve  these  training  problems. 

Current  advances  in  artificial  intelligence  and  natural  language  processing 
technology  provide  a  unique  opportunity  for  the  development  of  intelligent 
ocmputer-assicted  language  learning  (CALL)  to  support  foreign  language 
instruction  in  the  Amy.  CALL  has  been  used  in  foreign  language  learning  for 
several  years  with  varying  degrees  of  success  (Smith,  1967).  However,  computer 
technology  has  recently  advanced  to  a  point  that  makes  the  presentation  of 
natural,  contextualized  language  use  feasible  in  a  computerized  delivery 
system. 

The  specific  focus  of  our  research  requirement  at  the  U.S.  Amy  Research 
Institute  is  the  development  and  application  of  these  high  technology  advances 
to  the  teaching  of  foreign  languages  in  the  Amy  schoolhouse.  The  principle 
technologies  we  view  as  relevant  to  this  training  problem  are  natural  language 
processing  (NLP)  and  intelligent  tutoring  systems  (ITS).  Tools  and  formalisms 
that  support  this  technology  to  structure  knowledge  as  it  is  presented  and 
acquired  in  an  intelligent  CALL  include  hypertext,  grammar  formalisms, 
parsers,  and  algorithms  and  models  for  presenting  and  teaching  skills  in 
intelligent,  adaptive  environments.  Criticisms  to  CALL  (Smith,  1987)  state 
that  only  certain  of  the  four  basic  language  skillB  (reading,  writing,  speak¬ 
ing,  hearing)  are  supported  in  this  environment.  While  this  argument  is  valid, 
v*  propose  that  for  those  skills  supported  in  CALL  systems,  an  instructional 
foreign  language  environment  built  with  natural  language  processing  representa¬ 
tions  and  using  hypertext  to  structure  the  knowledge  can  improve  language 
skill  acquisition  ani  retention  in  these  areas.  Such  a  CALL  system,  further 
embellished  by  intelligent  tutoring  system  technology  and  even  audio-visual 
support,  with  video  disc  or  (D-RCM,  is  envisioned  as  a  support  to  traditional 
foreign  language  training  in  the  classroom. 

The  foreign  language  tutoring  system  we  envision  will  provide  students  with 
multimedia  interactions  that  support  listening,  reading,  and  writing  skills. 
Students  will  interact  with  the  system  by  typing  in  responses  to  realistic 
oatputer  "dialogues."  The  ccrrputer  will  be  equiped  to  understand  these 
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responses  and  provide  sound  adaptive  instruction  as  needed  by  individual 
students.  Sound  for  language  sanples  will  be  provided  on  oatrrand  to  retain 
students'  listening  skills.  Students  could  be  encouraged  to  speak  and  repeat 
what  is  heard,  although  the  ocwpubar  would  not  be  able  to  process  spoken  input. 
Nevertheless,  intelligent  CAUL  premises  to  provide  Amy  foreign  language 
teachers  with  a  multimedia  language  lab  environment  suitable  for  exercising  the 
student  in  various  drill  and  practice  activities  that  are  prerequisite  to 
active  language  production  skills. 

In  this  report,  we  review  the  tools,  technologies,  and  exponents  for 
developing  intelligent  CALL.  We  close  with  a  discussion  of  an  intelligent 
CALL  prototype  that  exploits  these  technological  advances.  An  appendix 
listing  a  glossary  of  technical  terns  and  their  definitions  is  provided  to 
assist  readers  with  the  oentent  of  this  report. 

Hypertext  Tools  for  Structuring  Instructional  Knowledge 

Hypertext  software  provides  a  text-based  system  for  ccnputer  applications 
that  goes  beyond  simple  text  to  include  graphics,  video,  and  sound  as  well  as 
links  between  cards  of  information  and  cross  references  that  result  in  networks 
or  graphs  of  information  structures.  Hypertext  systems  provide  the  user  with 
a  variety  of  paths  through  the  information  structure.  Command  menus  are 
usually  used  to  help  traversal  through  some  hypertext  space.  A  very  real 
question  worthy  of  further  research  is  how  to  best  organize  the  knowledge  for 
presentation  using  hypertext  software  and  tools.  This  is  especially  critical 
when  hypertext  is  used  to  develop  instructional  systems  (Russell,  Horan,  and 
Jordon,  1988). 

The  structure  of  semantic  memory  has  been  analyzed  by  psychologists  in 
models  of  networks  of  concepts  linked  together  in  a  coherent,  associated 
structure  (Anderson,  1983;  Collins  &  Qui Ilian,  1969).  This  work  is  ideally 
suited  far  cognitive  modeling  in  a  hypertext  ccnputer  system.  Within  such  a 
network  model,  the  structure  of  concepts  and  the  links  between  them  are 
dependent  on  the  relationship  of  features  and  default  values  stored  at  each 
concept  node.  The  question  of  the  psychological  reality  of  these  descriptions 
has  been  subordinated  for  the  time  being  to  the  need  to  explore  the  utility  of 
these  representation  schemes  for  instructional  purposes.  However,  work  cn 
techniques  for  describing  cognitive  structures  using  hypertext  models  is 
proceeding  steadily  within  the  domain  of  cognitive  science. 

For  exanple,  in  the  application  of  hypertext  as  a  tool  for  creating 
oenputer-based  foreign  language  instruction,  we  can  use  hypertext  models  to 
hypothesize  about  the  organization  of  vocabulary  in  memory  (Swartz,  in  press). 
Similarly,  hypertext  can  be  used  to  develop  actual  instructional  modules. 
Hypertext,  coupled  with  more  powerful  natural  language  technologies  on  the 
ccnputer,  promises  to  provide  the  means  for  creating  new,  advanced  technology 
training  systems. 

Natural  Language  Processing  in  Computer  Systems 

Ccmnuni  eating  with  computers  via  natural  language  interfaces  remains  a  very 
difficult  research  problem.  This  is  because  understanding  free- form  language 
input  by  ccnputer  systems  involves  considerable  work  in  developing  databases 
that  have  information  about  the  context,  word  knowledge,  and  linguistics. 
Nevertheless,  current  work  in  natural  language  processing  and  linguistic 
analysis  has  developed  representation  formalisms  and  parsing  strategies  that 
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ray  enable  same  progress  in  solving  the  natural  language  problem.  In  order  to 
develop  such  an  Interface  and  further  extend  it  in  a  ocrputer  environment  that 
can  understand  and  respond  to  not  only  English,  but  any  other  natural  language, 
we  must  first  understand  the  natural  language  itself  and  haw  it  is  used  in 
oamunicaticn.  Thus  we  msst  study  human  conversation,  discourse  processes,  and 
language  comprehension  and  production  skills,  While  this  paper  will  not  review 
the  psycholinguistic  parameters  pertaining  to  foreign  language  learning,  we 
mention  it  as  one  of  the  essential  opponents  in  developing  intelligent  CALL. 
After  all,  how  can  we  represent  natural  language  in  a  ocrputer  system  without 
an  understanding  of  language  use  in  situ? 

Natural  language  understanding  computer  systems  have  been  developed 
(Winograd,  1972, )  and  mutfi  recent  work  has  occurred  in  this  area  to  exploit  the 
earlier  advances  (Allen,  1987).  A  computer  system  that  can  not  only  understand 
but  generate  language  must  be  capable  of  several  essential  functions.  The 
system  nust  perform  1)  lexical  analysis  (vocabulary  understanding),  2) 
morphological  analysis  (word  formation),  3)  syntactic  analysis  (grammar 
understanding),  4)  semantic  analysis  (contextual  maiming) ,  and  pragmatic 
analysis  (language  use  in  context).  These  analytic  processes  depend  not  only 
on  language  understanding,  but  on  real  world  knowledge  and  context  in  which  to 
ground  any  potential  language  utterance.  In  a  ccnputar  system,  this  requires 
that  specific  kinds  of  knowledge  necessary  for  thews  analysis  procedures  be 
adequately  represented.  Indeed,  we  have  described  previously  how  knowledge 
representation  with  hypertext  systems  for  language  .instruction  is  of  critical 
importance.  However,  the  domain  knowledge  to  be  taught,  the  language  or 
"granmar",  must  also  be  represented  in  sene  formalism  so  that  the  computer  can 


variety  of  granmar  representations  which  address  this  issue  of  formalisms  in 
natural  language  processing  applications.  These  strategies  will  be  briefy 
explained  be  lew. 

The  representation  of  linguistic  knowledge  alone  without  contextual 
information  and  a  process  to  understand  the  language  will  not  solve  the  natural 
language  problem  for  Intelligent  CALL,  however.  We  also  need  a  mechanism  for 
matching  the  language  input  a  student  provides  with  the  granmar  representation 
in  the  computer  system.  This  process  is  oallsd  "pursing"  and  we  will  see  that 
different  kinds  of  parsing  can  be  performed  depending  cn  the  context 
surrounding  the  language  input  and  system-dependent  efficiency  methods  for  the 
particular  parsing  strategy  used. 

Technological  advances  in  this  subdiscipline  have  provided  powerful 
fbrmalimc  for  representing  natural  languages  in  computer  systems.  These 
representations  premise  to  address  the  issues  of  lexical,  morphological, 
semantic,  and  syntactic  analysis  required  by  an  intelligent  CALL  environment. 
Parsing  strategies  vhidh  access  and  manipulate  the  language  representations 
will  address  syntactic,  semantic,  and  pracpntic  (to  acme  degree)  analysis.  An 
important  question  is  which  combination  of  granmar  formalisms  and  parsing 
strategies  will  provide  a  suitable  learning  environment  in  intelligent  CALL? 

By  finding  the  best  combination,  we  can  develop  a  tutoring  system  that 
provides  quick,  correct  analysis  of  student  input  in  the  foreign  language.  The 
decision  on  what  combination  of  granmar  and  parser  to  use  is  based  on  several 
criteria.  These  criteria  include  providing  a  system  that  is  psychologically 
realistic.  Far  exanple,  the  computer  should  be  able  to  parse  "colorless  green 
ideas  sleep.  "  and  inform  the  student  that  while  granmatically  correct,  the 
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sentence  is  meaningless.  Secondly,  instructional  goals  for  the  system  will  be 
used  to  decide  hew  comprehensive  a  grainier  is  required  or  how  detailed  a  parse 
is  required  (i.e. ,  is  toy-word  matching  enough  or  must  the  system  understand 
complex  grammar  structures?) ,  Thus  we  review  several  current  grammar 
formalisms  and  parsing  irtrategies  below  to  provide  an  overview  of  these 
technologies. 

Grammar  Representation1; 

Semantic  Grammars.  A  semantic  grammar  is  based  on  a  simplified  conceptual 
depenciancy " construct  (lurton  and  Brown,  1979).  The  grammar  is  organized  as  a 
set  of  attribute- specific  semantic  categories  far  a  given  domain.  This 
construct  has  been  used  in  early  natural  language  applications  for  handling 
question  asking  about  none  specific  topic.  In  NLP  the  granrnar  defines  the 
allowable  patterns  that  can  be  matched  with  a  given  input  string.  These  legal 
patterns  restrict  the  form  of  the  natural  language  which  cuts  down  on  arrbiguity 
and  provides  surprisingly  good  language  coverage.  A  sample  semantic  granrnar 
and  a  query  to  parse  might  lock  like  the  following: 

<e>  - >  What  is  <ship-prcperty>  of  <ship>? 

< ship-property > - >  the  <  ship-prop  >  <  ship  |  prep > 

<  ship-prop  >  - >  iipeed  length  type 

<ship>  - >  <ship-name> 

<  ship-name  >  - >  Kennedy  |  Kitty  jHaWk 

Query:  What  is  the  length  of  the  Kitty  Hawk? 

A  semantic  granrnar  is  context-free  and  can  be  parsed  by  any  of  the  several 
existing  context-free  parsers  (Earley,  1970 ) .  Context-free  means  that  word 
order  is  not  taken  into  account.  The  contextual  semantics  or  meaning  of  a 
given,  ordered  input  string  are  derived  from  the  word  categories  in  the 
gr aimer .  Key-word  and  pattern-matching  strategies  are  used  to  perse  and  answer 
queries.  While  thin  mechanism  allows  for  natural  language  interaction,  there 
is  no  flexibility  for  syntactic  deviations  in  question  form.  Lexical 
variations  will  also  oe  oonsljrained  to  the  vocabulary  represented  in  the 
semantic  grarimx. 

In  order  ix>  9*t  :more  flexibility  in  natural  ^language  input  and  parsing 
strategies  tJian  is  provided  in  a  semantic  grammar,  asperate  representations 
will  need  to  be  developed  to  represent  the  lexicon,  (word  dictionary), 
morphological  tables  (word  formation),  phrase  structure  rules  (rules  used  to 
derive  sentences  from  input  strings),  and  granrnar.  This  brings  our  discussion 
to  another  set  of  natural  language  processing  representations.  While  the 
unification  formalisms  to  be  discussed  next  allow  for  rich  natural  language 
interaction  with  computer  systsre,  they  pose  a  different  kind  of  problem  in 
parsing  in  regard  to  semantic  and  pragmatic  language  contexts. 

Unification  Formalisms.  Granrnar  formalisms  axe  representations  used  to 
describe  a  language,  its  set  of  sentences,  the  syntactic  features  of  the 
sentences,  and  their  semantics.  These  formalisms  have  been  constructed  to  help 
linguists  understand  universal  linguistic  theory  as  well  as  provide  a  means  for 
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A  LEXICAL-FUNCTIONAL  GRAMMAR  REPRESENTATION 


ccrputaticnal  interpretations  of  the  graimar  being  studied  when  it  is  represen¬ 
ted  in  sate  ecnputer  system.  A  unification  formalism  is  one  that  prescribes  a 
given  operation  (pattern-  Hatching)  between  an  input,  string  (a  sentence)  and 
the  graiimar.  Schreiber  (1986)  discusses  the  essential  assuiptions  of  a 
unification  formalism: 

1)  it  is  surface-based  with  a  direct  relation  to  the  surface  carder 
of  the  sentential  elements 

2)  it  associates  the  sentence  string  elemea.  a  with  seme  relevant 
information  (knowledge)  domain 

3)  it  is  inductive,  defining  the  association  between  an  element 
and  the  information  domain  recursively  and  according  to  a 
specified  inference  procedure 

4)  it  is  declarative,  defining  permissible  associations,  not  how 
the  association  is  computed 

5)  it  is  context-feature-based  in  that  associations  between  features  and 
values  are  taken  from  a  well-defined,  structured  set 

Exairples  of  unification-based  grammar  formalisms  include:  Functional 
Unification  Grammar  (FUG),  Lexical-Functional  Grammar  (IPG),  General  Phrase 
Structure  Grammar  (GPSG),  and  Definite  Clause 'Grammar  (DOG).  Each  are  briefly 
defined  below.  For  more  information  the  reader  is  referred  to  the  references 
cited. 

The  FUG  representation  uses  functional  structures  as  generalized 
feature/value  pairs  (features  verb,  value:  to  see) to  denote  linguistic 
information  in  some  language  input.  Patterns  and  constituent  sets  are  used  in 
the  matching  or  unifying  of  the  sentence  elements  with  the  granmar.  FUG  also 
introduces  disjunction  (allowing  seperate  rule  analysis)  into  the  grammar  which 
allows  far  getting  around  any  potential  constraints  imposed  by  simultaneous 
satisfaction  of  all  applicable  rules.  In  contrast  to  the  LFG  formalism  below, 
FUG  deliberately  blurs  the  distinction  between  constituent  and  functional 
structures  (Dowt.y,  Karttunen,  and  Zwickey,  1985).  This  formalism  emphasizes 
the  functional  description  of  the  language  with  the  constituent  structure 
specified  by  a  means  of  patterns  for  noun  and  verbal  elements  rather  than 
phrase  structure  rules. 

LFG  was  developed  primarily  as  a  linguistic  theory  (Bresnan,  1982)  and  then 
became  recognized  as  a  useful  formalism  for  representing  natural  languages  in 
oonputer  systems.  IFG  is  based  on  a  model  of  syntax  that  is  not  completely 
structurally-based  (Sells,  1985).  The  formalism  describes  grammatical 
functions  (subject,  object)  which  are  represented  by  functional  (F~)  structures 
in  the  grander.  An  F-structure  for  a  given  sentenoe  describes  the  appropriate 
grammtioal  functions  or  role  for  the  sentence  constituents  ( "Jam/"  mi#it 
function  as  a  subject  in  one  sentence  and  an  object  in  another) .  Constituent 
(C-)  structures  (noun  phrase,  verb  phrase)  are  the  part  of  the  formalism  that 
represents  the  syntax  of  r  sentence  (see  Figure  l)o  The  lexical  ccnpcnent  of 
the  theory  enphasizes  the  commitment  of  LFG  to  characterize  semantic  relations 
between  constituent  elements.  LFG  can  thus  analyze  grarrnrticality  at  the 


lexical  level,  'mis  formalism,  nore  than  the  others  described  here,  has  been 
used  to  investigate  peydholinguistic  issues  in  relation  to  language  processing 
systems  (Ford,  Bresnan,  Kaplan,  1981). 


GPSG  also  came  into  being  from  a  linguistic  analysis  _ 

motivation.  This  formalism  attenpts  to  retain  a  formally  restrictive  system  of 
surface  feature  structures  that  can  handle  a  wide  variety  of  semantic  and 
syntactic  phanomna.  Unification  procedures  used  to  analyse  language  input  are 
nljn  hi<4*ly  restructured.  A  given  phrase  structure  tree  for  a  sentence  will 
satisfy  various  rules  in  the  GPSG  grammar  baaed  upon  specific  principles 
(control  agreement,  head  feature)  specified  in  the  formalimn. 

DOG  arose  frcm  work  in  the  PROLOG  programming  language  (the  other  formalisms 
mnticned  above  are  generally  written  in  LISP) .  This  formalism  uses  term 
structures  rather  than  feature  structures  for  a  givsn  sentence.  T"[rns  ar®  the 
informational  elements  in  DOG  with  the  unification  principles  derived  from  the 
theorem-proving  nechaniam  inherent  in  HOLOG.  Examples  of  terms  are. 

s  (head  (subjHead,  farm))) 


np  (subjhead) 

rp  (head  (subjHe&d,  form))) 

Term*  differ  from  the  feature  structures  in  FUG,  IPG  and  GPSG  in  that  they 
identify  sentence  constituent  values  by  their  order  in  the  term  structure 
rather  than  their  association  with  the  feature. 


Parsing  Strategies 

Parsing  is  "searching  a  space  of  possibilities"  in  an  input  string  (Kay, 
1985)  as  the  string  is  matched  with  the  grammar.  The  discussion  of  parsing  cn 
ooncuter  systems  will  be  facilitated  by  sans  simple  definitions  taken  from 
Earley,  (1970).  A  language  is  a  set  of  strings  made  up  of  s  sequence  of 
syrrbols.  These  are  referred  to  as  terminal  synbols  that  rspreaant  the  words 
defined  in  the  language.  A  oontext??rse  graimar  is  a  formal  device  that 
specifies  the  strings  in  a  given  aat.  This  granxmr  also  uses  non-terminal 
synbols  that  represent  syntactic  classes  of  words.  A  sentence  is  a  string  of 
several  terminal  and  non-terminal  synbols.  When  parsing  occurs,  an  algorithm 
to  search  the  string  and  make  retches  between  its  elements  and  the  granmar  is 
performed.  There  are  many  strategies  one  can  use,  transition  network  parsing, 
top-down  parsing,  bottom-qp  parsing,  and  deterministic  parsing.  Which  approach 
is  used  depends  cn  the  domain  and  the  goal  of  the  natural  language 

Bers^we^are  *  interested  in  the  needs  of  an  instructional  system  for  foreign 
language  that  requires  quick,  accurate  parsing. 

An  augmented  transition  network  (ATM)  (Woods,  1970)  is  one  way  to  parse 
language  going  frtm  an  initial  state  (the  first  word  in  a  string)  to  a  final 
atate  (the  last  word  in  a  string).  ATM's  use  the  nodes  in  a  network  of 
language  elements  to  represent  parts  of  speech  or  phrase  names.  The  arcs 
between  nodes  are  used  to  implement  certain  actions  when  they  are  transversed 

(See  Figure  2). 
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Push  NP 


Figure  2.  This  ATM  illustrates  the  states  and  transition  arcs 
for  parsing  a  noun  phrase  with  a  prepositional  phrase.  The 
circles  represent  states.  The  noun  phrase  has  an  article/  possible 
adjectives  (the  ADJ  loop  arc  on  the  middle  state) ,  and  a  noun  with 
a  loop  for  possible  prepositional  phrases.  The  second  part  of  the 
ATN  shows  the  states  and  transition  arcs  for  parsing  the  prespas tianal 
phrase. 
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Augmenting  such  a  network  allows  for  three  kinds  of  transitions: 

1)  Adding  arbitrary  tests  to  the  arcs  such  as  agreement  of  a  word  and  its 
modifier. 

2)  Adding  structure-building  actions  to  arcs  to  be  used  by  the  parser  to 
determine  semantic  analysis  or  active-passive  sentence  transformations. 

3)  Allowing  subroutines  to  be  called  on  arcs.  For  example,  in  Figure  2, 
a  preposition  (PP)  is  attached  to  a  noun  phrase  by  a  subroutine  call  to 
PP. 

Utile  the  ASM  formalism  provides  imny  advantages  for  its  use  in  natural 
language  (conciseness  of  the  representation  and  conceptual  effectiveness)  and 
can  deal  with  complex  linguistic  phenomena),  it  has  a  high  cost  for  processing 
speed.  This  is  because  AIR's  are  interpreted  processes.  Interpreted  processes 
occur  in  real  time  and  nay  be  frusta  ting  for  students  since  more  time  to  parse 
AID'S  is  needed. 

Top-down  parsing  is  based  on  the  phrase  structure  rules  it  uses  and  is 
expectation  or  goal-driven.  ‘Given  a  string,  the  parser  starts  by  applying  a 
particular  phrase  structure  rule,  the  goal,  to  elements  in  the  input  string. 
Conversely,  bottcm-up  parsing  is  data-driven.  This  procedure  starts  with  words 
in  the  input  string  and  locks  for  phrase  structure  rules  that  will  allow  a 
parse.  Figure  3  illustrates  these  two  approaches  schematically. 

Marcus  (1960)  developed  a  deterministic  parser  (vhere  only  one  next  step  is 
allowed  far  a  given  input)  to  avoid  the  backtracking  problem  required  when 
sentence  inputs  involved  canplex  clause  construction  and  potential  "garden 
path"  or  ambiguous  parses.  This  means  that  the  computer  would  have  to  keep 
track  of  the  terminals  in  a  string  and  "backtrack"  to  these  syntools  in  case  the 
sentence  being  parsed  might  have  more  than  one  meaning.  Other  new  parsing 
strategies  being  developed  include  the  use  of  the  special  features  of  logic 
programming  in  PROLOG  to  execute  parses  from  a  DCG  representation.  PROLOG 
handles  backtracking  problems  very  easily. 


With  this  brief  overview  of  grairmar  formalisms  and  parsing  strategies,  we  can 
see  that  the  technology  exists  for  handling  natural  language  cn  computer 
systems.  In  order  to  understand  how  this  technology  can  be  used  to  develop 
intelligent  CALL  for  foreign  language  training,  however,  we  turn  to  a 
discussion  of  the  technology  available  for  creating  intelligent  tutoring 
systems. 


Intelligent  Tutoring  Systems  (ITS)  for  CALL 

Over  the  past  five  years,  AR1  has  accumulated  a  great  deal  of  practical 
knowledge  about  expert  systems  and  intelligent  conputer-assisted  instruction. 
This  work  has  carried  out  studies  and  development  efforts  that  have  created 
working  ITS  in  the  domains  of  electronic  troubleshooting,  computer  programming, 
and  technical  training  (Simutis  and  Psotka,  1987;  Psotka,  Massey  and  Mutter, 
1988).  Fran  these  efforts,  it  is  safe  to  say  that  several  of  the  technology 
base  domains  are  well  developed  and  ready  to  apply  in  broad  Amy  mission  areas 
(Fletcher  and  Psotka,  1986).  As  ITS  becomes  more  practical  and  effective, 
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their  applications  are  moving  from  more  well  understood  and  tractable  domains 
(electronics  troubleshooting,  math)  (Sleeran  and  Brown,  1982)  to  fuzzier 
complex  skill  areas  such  as  foreign  language  learning.  However,  in  addition  to 
having  the  ccrrputaticnal  resources  to  develop  these  system,  we  reed  to 
understand  what  goes  on  in  good  tutoring  from  the  standpoint  of  the  tutor,  the 
knowledge  taught,  and  the  student. 

A  good  human  tutor  is  an  expert  in  the  dcimin  being  taught  and  an  expert 
pedagogue.  Being  a  good  pedagogue  implies  a  number  of  sophisticated  skills.  A 
good  tutor  is  able  to  accurately  monitor  each  trainee's  knowledge  state.  That 
is,  the  tutor  is  able  to  determine  what  each  trainee  knows,  does  not  know,  and 
knows  Incorrectly.  The  seoond  part  of  that  skill,  knowing  vftiere  trainees 
typically  go  wrong,  has  turned  out  to  be  the  key  to  developing  good  ITS.  The 
tutor's  knowledge  of  the  trainee's  level  of  knowledge  extends  in  time  allowing 
the  tutor  to  make  predictions  about  instructional  level  and  pacing.  The  tutor 
has  an  extensive  set  of  knowledge  resources  i  bocks,  graphics,  films, 
carefully  prepared  courses  of  instruction,  and  the  skill  to  vise  these 
appropriately.  Finally,  the  tutor  is  able  to  narshall  all  these  resources  and 
skills  in  a  carefully  coordinated  interaction  with  the  trainee.  These  aspects 
of  a  good  human  tutor  are  analogous  to  the  nodules  that  make  up  an  ITS:  the 
expert  model,  the  pedagogical  nodal,  the  student  model,  and  the  interface .  The 
first  three  will  be  discussed  here  in  relation  to  developing  intelligent  CALL. 

The  Expert  Ftodel 

The  expert  model  in  an  ITS  represents  the  domain- specific  knowledge  and  the 
inferencing  knowledge  or  reasoning  processes  involved  in  solving  problems  in 
the  instructional  domain  (Clancey,  1986).  As  part  of  the  domain- specific 
knowledge,  a  rich  knowledge  base  of  facts  and  skills  the  student  will  need  to 
learn  is  represented.  Often  these  representations  take  the  form  of  semantic 
networks.  The  inferencing  knowledge  is  a  representation  of'  the  procedures  an 
expert  in  the  domain  would  use  to  reason  about  or  apply  to  the  domain-specific 
knowledge  when  solving  a  problem.  This  kind  of  procedural  knowledge  often 
takes  the  form  of  production  rules. 

In  foreign  language  learning,  the  form  of  the  dcmein- specific  knowledge  in 
the  expert  model,  a  particular  foreign  language,  would  be  represented  fcy  one  of 
the  grammatical  formalisms  (FUG,  DCG,  LFG,  GPSG)  described  above.  The  selected 
formalism  would  represent  the  grammatical  rules  and  lexicon  items  with 
requisite  features  and  properties  that  describe  the  semantics  and  syntax  of  a 
given  language.  This  domain-specific  knowledge  is  What  a  native  speaker  of  the 
language  knows.  It  would  change  from  language  to  language.  However,  world 
knowledge  that  is  needed  to  understand  the  contexts  of  the  language  in  use 
must  also  be  represented  and  tied  in  with  the  expert  grammar.  ‘This  knowledge 
might  be  represented  in  a  semantic  network  or  script-like  representation  to 
provide  the  context  or  situation  in  vhidh  the  language  to  be  taught  is  used . 

The  inferencing  knowledge  in  the  expert  model  far  a  foreign  language  tutor 
would  be  a  particular  parsing  strategy  that  would  be  applied  to  the  granviar  and 
lexicon  to  both  generate  and  understand  language  within  a  given  oontext.  These 
parsing  strategies  would  most  probably  take  the  form  of  rules  and  be  a  separate 
knowledge  structure  within  the  expert  model. 
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The  Student  Model 


The  student  node!  oarponent  in  ITS  is  the  information  that  describee  a 
student's  knowledge  about  vhat  is  being  taught  and  that  allows  the  tutor  to 
adapt  subsequent  instruction  to  the  student's  needs  (Clancey,  1986).  Self 
(1974)  refers  to  a  student  model  ae  a  set  of  programs  designed  to  represent  a 
student's  knowledge  state.  VanLehn  (1988)  specifies  that  two  components  make 
vp  a  student  model «  the  structure  of  representation  of  the  knowledge  in  the 
nodei ,  and  the  process  that  manipulates  or  transforms  the  structure.  He 
further  defines  these  two  oarponent s  as  the  model  itself  (structure),  and 
diagnosis  (process)  as  the  mechanism  for  transforming  the  student  model. 
Knowledge  state  assessment  is  therefore  the  cutocrae  of  this  process.  Hew  the 
student  nodel  is  fbmulated  depends  on  measuring  performance,  identifying 
errors  and  nit  conceptions,  and  caiparing  this  knowledge  to  that  represented  in 
the  expert  model  of  the  target  domain.  All  of  these  parameters  are  contingent 
upon  how  the  detain  knowledge  to  be  taught  is  represented  in  the  expert  model 
of  the  tutoring  system.  Since  foreign  language  learning  is  the  domain  of 
interest,  the  student  model  structure  would  be  some  subset  of  knowledge  as 
represented  in  the  grammar  and  lexicon  in  the  expert  model.  Because  the 
student  nodel  is  derived  from  the  interaction  between  the  student,  the  expert 
model,  and  the  pedagogical  model  in  the  tutoring  system,  the  form  of  the 
representation  of  these  individual  models  as  well  as  the  diagnostic  process 
knowledge  used  for  modeling  a  student  are  critical  to  the  success  of  an  ITS  in 
the  modeling  task.  Several  methods  for  enabling  the  diagnostic  process  exist 
such  as  differential,  overlay,  and  model  tracing  techniques.  Clancey  (1986) 
provides  a  good  overview  of  these  methods  in  existing  systems. 

The  Pedagogical  Model 

This  nodel  repreeents  the  pedagogical  strategies  and  lessens  to  be  presented 
to  the  student.  In  addition,  the  diagnosis  process  plays  a  role  in  this  model 
so  that  the  computer  tutor  can  present  remedial  lessons  based  on  the  diagnoses 
of  student  errors.  This  component  of  the  ITS  generates  instruction  based  upon 
the  student's  instructional  history  during  a  session  with  the  computer.  The 
pedagogical  nodel  uses  information  fron  the  diagnostic  process  to  identify  a 
student's  particular  instructional  need  and  then  adapts  the  instruction 
accordingly.  This  activity  uses  sophisticated  planning  techniques  (Russell, 
1988)  to  change  pedagogical  strategies  and  tactics  according  to  a 
student's  performance  in  a  lessen.  This  adaptive  approach  differs  drastically 
from  more  traditional  oerputer-based  training  (CBT)  vhich  presents  students 
with  prestored,  frame-based  sequences  of  instruction. 

The  pw^g^ioti  strategies  and  tactics  that  nust  be 
represented  in  a  foreign  language  tutor  should  represent  methods  appropriate 
for  teaching  in  CALL  environments  and  that  are  sensitive  to  the  particular 
learning  strategies  involved  in  language  learning.  More  importantly,  this 
instructional  knowledge  must  follow  a  principled,  theoretically  grounded 
approach  to  the  lai.guage  learning  process.  Research  on  hew  to  rationalise 
such  an  approach  for  designing  language  instruction  is  currently  underway 
(Swartz  and  Russell,  1989). 
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Prototype  Intelligent  CALL 

We  have  described  the  existing  technology  and  tools  that  are  available  for 
creating  intelligent  CALL.  But  how  are  these  complex  formalisms  and  tutoring 
nodules  to  be  integrated  into  a  single  coherent  instructional  system?  The 
following  is  an  overview  of  one  such  prototype. 

A  fair  amount  of  basic  research  has  been  deployed  with  the  goal  of  using  NLP 
techniques  for  language  instruction.  Barchan,  Woodmans ee,  and  Yazdani  '1986) 
provide  a  short  review  of  the  topic.  An  early  attempt  to  use  NLP  techniques 
for  language  instruction  (Feuerman  et  al.,  1987)  has  shown  great  promise.  He 
system  use»  a  Spanish  grammar  written  in  the  IPG  formalism  to  drive  an 
interactive  dialogue-based  tutoring  system  in  beginning  Spanish.  The  choice  of 
IPG  is  fairly  important  although  many  other  systems,  such  as  FUG  and  GPSG  now 
have  very  similar  characteristics.  The  functional  parsing  approach  used  fcy  IPG 
has  a  clear  advantage  over  many  other  formalisms  in  that  it  gets  all  its 
semantic  and  syntactic  meaning  iron  the  representation  of  the  lexicon.  This 
factor  saves  computational  effort  for  parsing  input  strings  when  compared  to 
other  formalisms.  Thus  the  interface  the  student  uses  provides  timely 
responses  to  input  typed  into  the  system. 

Feuexman's,  et  al,  Computer-Assisted  Language  Learning  Envinrcnment  ( CAT J £ ) 
capitalizes  on  the  use  of  hypertext  to  present  the  student  with  various 
instructional  interactions.  CM. IE  engages  trainees  through  a  number  of 
functional  hypertext  windows  shown  in  Figure  4.  The  main  dialogue  window 
provides  the  Key  functionality:  cn-line  exercises  in  Which  trainees  can 
engage  in  realistic  foreign  language  dialogues  with  the  machine.  These 
dialogues  are  monitored  fcy  special  rules  to  determine  Which  immediate  lessen 
goals  have  been  exercised,  and  which  remain  to  be  tested.  The  system 
deliberately  generates  dialogue  contexts  to  exercise  the  remaining 
instructional  goals  as  the  "conversation"  proceeds.  Other  window  allow 
trainees  to  use  the  foreign  language  without  being  directed  by  the  machine's 
specific  dialogue  goals  (the  “Try  It  Window"),  but  with  diagnostic  comments. 
Additional  tools  exist  for  the  trainer  to  create  scripts  to  drive  the  dialogue 
and  new  rules  for  presenting  additional  aspects  of  the  language.  Although 
CM1E  does  not  have  a  sophisticated  pedagogical  model  fcy  ITS  standards,  we  view 
it  as  a  precursor  for  creating  a  complete  ITS  with  robust  tutoring  strategies 
and  adaptive  presentation.  While  student  errors  are  Identified,  CALLE  lacks 
true  student  modeling  capability.  Nevertheless,  the  basic  elements  of  a  robust 
tutoring  the  system  are  present  and  can  be  upgraded. 

CONCLUSION 

The  technologies  reviewed  here  indicate  that  the  time  is  ripe  for  research 
and  development  of  advanced  technology  in  foreign  language  training  systems  for 
the  Army.  ARI  is  currently  actively  engaged  in  such  work. 

CALLS  was  developed  as  a  prototype  tutor  of  linguistic  knowledge  using  a 
combination  of  natural  language  processing,  hypertext,  and  ITS  technology  to 
demonstrate  the  feasability  of  such  an  approach  for  language  training.  It  was 
designed  to  strengthen  the  procedural  skills  of  using  a  language  in  natural 
discourse  and  to  encourage  the  self-correction  of  a  trainee's  grammatical  rules 
and  understanding  in  an  interactive  environment. 
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Figure  4.  The  windows  shc**i  here  Illustrate  a  typical  dialogue 
exchange  between  the  tutor  and  student.  Errors  are  flagged  and 
sinple  feedback  provided.  Other  windows  let  the  student  see  the 
parse  tree  for  a  phrase  and  even  its  functional  structure. 


While  CAT  IE  is  not  a  complete  foreign  language  ITS,  it  does  demonstrate  the 
potential  for  such  a  tutoring  eystem  for  the  Army  schoolhouse  language  skill 
retention  curriculum.  Current  research  at  ARI,  both  contractual  and  in-house, 
continues  to  investigate  these  technologies  and  systems  for  developing  intelli¬ 
gent  CALL.  Not  only  will  we  explore  the  potential  of  these  technologies  for 
language  learning,  but  basic  cognitive  and  psycholinguistic  research  is 
planned  to  assess  the  usefulness  of  structuring  knowledge  for  instructional 
presentation  and  its  effect  on  the  recall,  retention,  and  use  of  language 
within  robust,  advanced  technology  training  systems. 
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APPENDIX  A 


Glossary  of  Technical  Tams 

AIN  (Augurrerrted  Transition  Network)  -  This  is  cne  of  several  parsing  strategies 
used  to  assign  meaning  to  language  input  on  sane  ccnputer.  An  AIN  is  trade  up 
of  a  set  of  transiticn  netvorks  with  various  nodes  for  representing  components 
of  language.  For  example,  a  network  may  have  a  node  for  a  noun,  another  for  an 
adjective,  a  verb  or  a  preposition.  The  nodes  of  the  network  are  connected  by 
links  or  arcs.  Conditions  associated  with  the  arcs  are  used  to  augnent  the 
network  as  sene  language  string  is  analyzed.  For  example,  one  condition  might 
be  "assign  subject".  When  a  transiticn  is  made  front  one  node  (a  noun)  or  state 
to  another,  and  the  condition  is  met,  then  the  ATN  can  assign  the  category 
"subject"  to  the  noun  analyzed. 

Backtracking  -  This  is  a  search  network  vised  by  parsers  to  solve  ambiguity  When 
a  sentence  may  have  multiple  meanings.  If  a  node  in  some  network  is  visited 
but  does  not  meet  a  specified  condition  (a  failure) ,  this  search  technique 
allows  the  next  adjacent  node  to  be  visited.  Backtracking  lets  the  ccnputer 
"go  back"  and  search  alternative  derivations  for  a  sentence. 

Bottam-up  Parsing  -  This  type  of  parsing  is  data-driven.  The  parser  begins  by 
locking  at  the  constituent  values  (i.e. ,  "bey",  "runs")  in  a  sentence  (the 
data)  and  matches  them  to  their  nonterminal  synbole  (i.e.,  "noun",  "verb"). 

No  predictions  ere  made  and  the  parser  oust  go  throu^i  an  analysis  of  all 
oenstitutents  in  a  string  before  generating  a  legal  parse.  This  means  that 
this  strategy  alone  can  be  inefficient  since  wrong  parses  can  be  generated  en- 
route  to  the  final  solution. 


Constituent.  Structure  -  A  sentence  is  made  up  of  various  constituents,  nouns, 
verbs,  adjectives.  The  constituent  structure  for  a  particular  sentence  is 
represented  as  a  hierarchical  tree  form  of  phrasal  elements  that  proceed  fran 
the  top  (sentence)  to  intermediate  level  (noun  phrase,  verb  phrase)  to  lower 
level  (nouns,  verbs,  adverbs,  prepositions,  determiners)  constituents. 

Context-free  Grammar  -  A  granrar  that  oonsists  of  a  set  of  rewrite  rules  each 
of  Which  has  a  left-hand  side  and  a  right-hand  side  separated  by  soma  symbol , 
generally  — >.  The  left-hand  side  of  each  rule  is  a  nonterminal  sysrrtol  of 
the  granrar;  the  right-hand  side  is  a  sequence  of  nonterminal  symbols  and 
terminal  symbols.  Nonterminal  symbols  are  usually  surrounded  by  angle 
brackets,  for  example;  <  symbol  >  — >  <  one  or  more  symbols  >. 

Deterministic  Parsing  -  This  parsing  strategy  doesn't  "lock  ahead"  or  consider 
other  search  alternatives  vhile  parsing  a  string.  A  deterministic  parser  uBes 
the  information  available  at  the  time  of  analysis  of  seme  sentence.  Since 
alternative  search  methods  are  not  used,  only  one  meaning  for  a  sentence  is 
generated. 

Expert  Model  -  A  representation  or  knowledge  base  in  an  ITS  for  the  domain 
skill  (i.e. ,  foreign  language,  radar  mechanics) .  Knowledge  is  represented  as 
rules  or  schwas,  for  example.  The  expert  model  knowledge  is  used  to  teach 
students  in  ITS  lessons. 

functional  Grammar  -  A  list  of  rules  that  provide  a  functional  description 
(subject,  object,  preposition)  for  elements  in  seme  language  string.  These 
rules  of  functions  are  made  up  of  a  set  of  attributes  and  values.  Each 
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attribute  value  pair  represents  a  function  car  grtsmatical  relation.  Far 
example,  an  attribute,  SUBJECT  might  have  a  value,  PRCNCXJN. 


Grammar  -  A  set  of  rules  that  determines  Which  strings  of  words  are  legal 
sentences  in  a  language.  These  rules  must  also  specify  the  syntactic  structure 
for  the  sentences.  Generally,  phrase  structure  rules  are  used  to  define  a 
grannar. 


Intelligent  Tutoring  Systems  (ITS)  -  An  advanced  form  of  ocrputer-assisted 
instruction  that  adapts  instruction  to  the  individual.  Artificial 
Intelligence  technology  is  used  to  represent  an  expert  model  of  the  knowledge 
domain  and  a  student's  developing  model.  Both  models  are  carpared  during 
learning  and  instruction  adapted  accordingly  so  the  student's  knowledge  will 
approach  that  of  an  expert  in  the  field. 

Nonterminal  Symbol  -  An  element  that  represents  a  syntactic  word  class  in  a 
language.  For  exanple,  noun  (N)  and  verb  (V)  are  nonterminal  synbolB. 

Parsing  -  Parsing  is  an  algorithmic  operation  performed  by  some  device  (hmmn 
or  ira  chine)  on  seme  representation  of  a  natural  language  (i.e.,  a  sentence). 
The  result  is  the  computation  and  assignment  of  structural  relations  between 
words  after  they  have  been  recognized. 

Pedagogical  Model  -  This  representation  is  used  to  present  lessons  to  students 
in  ITS  based  on  error  information  and  student  performance.  This  model 
represents  the  knowledge  of  an  expert  tutor  according  to  a  specified 
instructional  strategy  or  teaching  principles. 
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ttirase  Structure  Rules  -  Each  rule  is  made  up  of  one  oar  more  expressions 
(phrase  oar  single  words)  that  represent  the  structure  of  sens  sentence,  The 
left-hand  side  of  the  rule  is  trade  up  of  one  or  nore  expression*  that  define 
the  expressions  one  the  ricfrit-hand  side.  Expressions  are  placed  on  the  right- 
hand  side  of  the  rule  in  a  particular  order.  For  exanple, 

S  — >  NP  VP  (Sentence  His  nade  up  of"  Noun  Riraae  +  Verb  Ihrase) 

NP  — >  BET  N  (Noun  Fhraae  "is  made  up  of"  determiner  +  noun) 

Semantic  Granrnar  -  This  grammar  uses  ssnantio  categories  far  terminal  symbols 
rather  than  a  rule.  An  exanple  is  presented  in  the  report.  These  gr aimers 
carbine  syntactic  and  semantic  processing  within  a  single  framevrork,  either  a 
context-free  gramrar  or  an  A3N. 

Semantic  Network  -  A  data  structure  for  representing  factual  knowledge.  In 
computer  science  and  psychology,  the  structure  takes  the  form  of  a  graph  in 
vhidi  the  nodes  represent  concepts  and  the  arcs  or  links  between  nodes 
represent  relationships  among  concepts. 

Student  Model  -  Strategies  used  in  ITS  that  ocrpare  student  performance  or 
errors  with  a  representation  of  the  expert  knowledge  base  or  model.  Overlay, 
differential  modeling,  and  nodal  tracing  are  axanplea  of  three  student  modeling 
strategies. 

Syntax  -  This  is  concemsd  with  the  structure  of  the  strings  of  syntols  that 
make  up  the  sentences  of  a  language. 
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-  An  element  that  represents  an  actual  word  defined  in  the 


language.  Far  exanple,  "adjective"  and  "soldier"  are  both  terminal  systole  in 
Ehgliah. 

TOp  -  down  parsing  -  This  type  of  parsing  is  goal-driven.  This  strategy 
expands  a  rule  from  the  tap  node,  for  example,  "S"  and  matches  this  symbol  with 
the  next  lower  level  symbol*  in  the  rule,  far  example,  "KP"  "VP".  Next  these 
intermediate  nodes  are  matched  to  the  lowest  level  of  values  or  terminals 
symbols.  A  top-down  parser  must  predict  what  constituents  a  sentence  will 
contain  based  on  the  surface  phrase  structure  rules  (i.e. ,  S  ->  NP  VP).  This 
method  continues  until  all  elements  in  a  string  are  assigned  a  value. 


